fast sffs-based algorithm for feature selection in biomedical datasets

Authors

f. shirbani

h. soltanian zadeh

abstract

biomedical datasets usually include a large number of features relative to the number of samples. however, some data dimensions may be less relevant or even irrelevant to the output class. selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. to this end, this paper presents a hybrid method of filter and wrapper feature selection that takes advantage of a modified method of sequential forward floating search (sffs) algorithm. the filtering approach evaluates the features for predicting the output and complementing the other features. the candidate subset generated by the filtering approach is used by k-fold cross validation of support vector machine (svm) with user-defined classification margin as a wrapper. applications of the proposed sffs method to five biomedical datasets illustrate its superiority in terms of classification accuracy and execution time relative to the conventional sffs method and another previously improved sffs method.

Upgrade to premium to download articles

Sign up to access the full text

Already have an account?login

similar resources

Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...

full text

Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...

full text

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

full text

Measuring Stability of Feature Selection in Biomedical Datasets

An important step in the analysis of high-dimensional biomedical data is feature selection. Typically, a feature subset selected by a feature selection method is evaluated for relevance towards a task such as prediction or classification. Another important property of a feature selection method is stability that refers to robustness of the selected features to perturbations in the data. In biom...

full text

feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

objective(s): this study addresses feature selection for breast cancer diagnosis. the present process uses a wrapper approach using ga-based on feature selection and ps-classifier. the results of experiment show that the proposed model is comparable to the other models on wisconsin breast cancer datasets. materials and methods: to evaluate effectiveness of proposed feature selection method, we ...

full text

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

full text

My Resources

Save resource for easier access later


Journal title:
amirkabir international journal of electrical & electronics engineering

Publisher: amirkabir university of technology

ISSN 2008-6075

volume 45

issue 2 2013

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023